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Abstract 

In this paper, an estimator of m instants (m is known) of abrupt changes of the parameter of long-range 
dependence or self-similarity is proved to satisfy a limit theorem with an explicit convergence rate for a sample of 
a Gaussian process. In each estimated zone where the parameter is supposed not to change, a central limit theorem 
is established for the parameter's (of long-range dependence, self-similarity) estimator and a goodness-of-fit test 
is also built. To cite this article: J.M. Bardet, I. Kammoun, C. R. Acad. Sci. Paris, Ser. I 340 (2007). 

Resume 

Detection de ruptures du parametre de longue memoire, d'autosimilarite pour des processus gaus- 
siens. Dans ce papier, pour une trajectoire d'un processus gaussien, un estimateur des m points de ruptures (m 
est suppose connu) du parametre de longue memoire ou d'autosimilarite est construit et on montre qu'il verifie 
un theoreme limite avec une vitesse de convergence explicite. Dans chaque zone (estimee) oil ce parametre est 
constant, un estimateur de ce parametre verifie un theoreme limite centrale et un test d'ajustement est egalement 
mis en place. Pour citer cet article : J.M. Bardet, I. Kammoun, C. R. Acad. Sci. Paris, Ser. I 340 (2007). 



Version frangaise abregee Le probleme de detection de points de ruptures moyennant la minimisation 
d'une fonction de contraste donnee a ete etudie depuis le milieu des annees 1990 dans le cadre de proces- 
sus a longue memoire (voir par exemple [9], [11], [12], [13] et [14]). De ces approches, certaines ont ete 
associees a un cadre parametrique, telle que la detection de rupture selon la moyenne et/ou la variance, 
d'autres traitees dans un cadre non parametrique (comme la detection de ruptures selon la distribution 
ou le spectre). Dans la litterature, differents auteurs ont egalement propose des statistiques de test de 
I'hypothese que le parametre est inchange contre le fait que le parametre de longue memoire varie en 
fonction du temps (voir par exemple [2], [7], [10]). A notre connaissance, le cadre semi-parametrique de 
detection de changements en longue memoire ou en autosimilarite n'a ete traite que dans [13] a partir 
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d'une technique basee sur le periodogramme. 

Notre approche est fondoo sur ranalyse par ondelettes, ce qui presente plusieurs avantages : c'est une 
technique non-paramctriquc applicable pour des processus tres generaux, robustc aux tendances poly- 
nomiales et, au moins dans le cadre gaussien, s'accompagnant de tests d'adequation de type simples 
et interessants a utiliser. Ainsi, un estimateur des m points de ruptures (m e IN*, suppose connu) de la 
longue memoire ou d'auto-similarite est congu pour un echantillon de processus gaussien en se basant sur 
I'analyse par ondelettes, ce qui permet ensuite de mettre en place des tests d'adequation. Pour ce type 
de processus, cette methode a ete proposee pour la premiere fois dans [8], puis developpee par exemple 
dans [1] . La convergence des estimateurs bases sur les ondelettes a ete etudiee dans le cas du mouvement 
brownien fractionnaire (FBM) dans [3], et dans un cadre semi-parametrique general de processus gaus- 
siens stationnaires a longue memoire par [15] et [5]. 

Ici le principe de I'estimation du paramctrc dc longue memoire ou d'auto-similarite est le suivant : dans 
chaque zone ou il n'y a pas de changement, ce parametre pent etre estimc a partir d'unc log-log regres- 
sion de la variance des coefficients d'ondelcttcs sur plusieurs echelles clioisics (voir (1)). Une fonction de 
contraste definie par la somme des carres des distances entre ces points et les droites d'ajustement, dans 
les m -I- 1 zones possibles detectees, est minimisee (voir (2)), donnant un estimateur des points de ruptures 
(voir (3)). Sous ccrtaines hypotheses gcncralos, on montre qu'il vcrifie un theoreme limite avec une vitesse 
de convergence cxplicite (voir Theorem 1.1). Dans chacune des zones dctcctccs, les parametres de longue 
memoire, (ou d'auto-similarite) peuvent etre estimes, tout d'abord avec la regression des moindrcs carres 
ordinaires (OLS), puis par une regression des moindres carres pseudo-generalises (FGLS). Un theoreme 
de la limite centrale est etabli pour chacun des deux estimateurs (voir Theorem 1.2 et Proposition 1.1 
ci-dessous) et des intervalles de confiance peuvent 6tre calcules. L'estimateur FGLS offre deux avantages : 
d'une part, sa variance asymptotique est plus petite que celle de l'estimateur OLS, et d'autre part, il 
permet la construction d'un test d'ajustement tres simple base sur le carre des distances entre les points 
(d'abscisse, le logarithme d'une echelle choisie et d'ordonnee, le logarithme de la variance empirique des 
coefficients d'ondelettes pour cette echelle) et les droites de regression pseudo-generalisee correspondantes 
(voir (8)). La convergence vers une distribution du Chi-deux de ce test est etablie dans le Theorem 1.3. 
Deux cas particuliers de processus gaussiens sont ensuite etudies dans la section 2. En premier lieu, on 
s'interesse aux series chronologiques stationnaires longue memoire avec un parametre de Hurst constant 
par morceaux. On se place dans un cadre semi-parametrique contenant par exemple les FGN et les pro- 
cessus FARIMA (voir Figure 1). En second lieu, le cas d'un processus a accroissements stationnaires et 
autosimilaire par morceaux est traite, ce qui revient a considerer des successions de FBM ayant des ex- 
posants de Hurst distincts (voir Figure 1). Pour ces deux exemples, les vitesses de convergence explicites 
des differents estimateurs et tests sont donnees et des simulations montrent leurs qualites (voir Table 1). 
D'autres simulations, preuves des theoremes ainsi qu'un exemple plus general de detection de ruptures 
dans le cadre de processus gaussien localement fractionnaire sont detailles dans [6]. 

1. Assumptions and main results 

Let (Xt)tg]isi be a Gaussian process and assume that [Xq, Xi, . . . , Xjv) is known. In the sequel, X will 
be a picccwisc stationary long memory time scries or a pieccwisc self-similar time series having stationary 
increments. Consider ^ :M, ^M. a. function called "the mother wavelet". For (a, b) € x R, the wavelet 
coefficient of X for the scale a and the shift b is dx{a, b) := /j^ 'ilj{^^)X{t)dt. When only a discretized 
path of X is available, approximations ex{a,b) are only computable: , 




for (a, 6) e R!;. X M. 
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Assume that there exist m, ^ IN (the number of abrupt changes) and 

• — Tq < tI < . . . < < T^+i = 1 (unknown parameters); 

• two famihes (a*)o<j<m G K.™^^ and {P*)o<j<m G (0,00)™+^ (unknown parameters); 

• a sequence of "scales" {an)nen € K,"^ (chosen) satisfying a„ > amin, with amin > 0, 

such that for j = 0, 1, . . . , m and k e D%{j) C [[Nt*], [NT*_^_l]] , 

E[e3f(aAr,A;)] - /3*(oAr)"^* when N ^ 00. 
A piecewise sample variance can be the appropriated estimator of such power law. Thus, define 

[fe'/ajv]-l 

k'-k 



^kiaN):=^^^ J2 ej,{aN,aNp) for < < A;' < TV. 



p=[fc/a«] 

Now set < n <...< ri with £ G M* , and assume that a multidimensional central limit theorem can 
be established for ^ log (S'^' (ri a^))^ when [Nt*] < k < k' < [A^t*_,_i], i.e. 



k'-k 



(log(5^'(ria;v)) -log(/3*)-a*log(ria;v))^^.^^ Af{0,r^'\a*,n,...,re)), (1) 

I k' —k\ — »oo 

with r^'^^ (a*, ri, . . . , r^) — {jpq)^^,^ a (^x£) matrix not depending on such that a i-^ r^-^^(a, n, . . . ,r^) 
is a continuous function and a positive matrix for all a. Define a contrast function 

mi 2 
UN{{aj)o<j<m, {f3j)o<j<m, ikj)i<j<m) = X! X! {^'^^ i^k^^^ i^i "-n)) - (ttj log(ri Oat) + log/3j)) (2) 

with {aj)o<j<m e ^'"+' C (/3i)o<i<m e 5"+' C (0,00)™+!, = fco < fcl^< ■ • ■ < < fcm+1 = 

A'', (A;j)i<j<TO e Km{N) C IN™. The vector of estimated parameters dij, Pj and kj (and therefore 7^) is 
the vector which minimizes this contrast function in A™~^^ x B™~^^ x Km{N), i.e., 

((Sj)o<j<m, iPj)o<j<m, ikj)i<j<m) := Argmin|j7jv((aj)o<j<m, {l3j)o<j<m, (fcj)i<j<m)} (3) 

Tj:=kj/N for 1 < j < m. (4) 

For a given (fcj)i<j<„i, it is obvious that iaj)o<j<m and (log/3j)o<j<m are obtained from a log-log 
regression of (5*^,^'^^ (r^ ajv)) ^ onto [I'idN^^, i.e. 

''log(riajv) 1^ 

= (L;.ii)-iLi.y,f ^ with ^:=( log (S,^;+^(r,.a;v)) , ^a. := : : • 

\ log(r£ ajv) 1 / 

Therefore (kj)\<j<m = Argmin|[/Ar ((Sj)o<j<m, i^j)o<j<m, ikj)i<j<m), {kj)i<j<m S Km{N)Y 

Remark 1 In this paper, m is supposed to be known. However, if m is unknown, as in [12] or [13], a 
penalized contrast U.rn,N = Un+0n x fn (with Pn an appropriated sequence converging toO) can be used 
instead ofUN, and by adding a minimization in m, an estimator fh of m could be also deduced. 

In this paper, parameters (a*) are supposed to satisfied abrupt changes: 

Assumption C : Parameters (a*) are such that |q!*_|_i — a* | ^ for all j = 0, 1, . . . , to — 1. 




Theorem 1.1 Define t* := {t* ,t^), t := (n, . . . , fm) and \\T\\m '■= '^slx. (^\ti\, . . . ,\Tm\) ■ Let £ G 
IN \ {0, 1,2}. // Assumption C and relation (1) holds with (Q:*)o<j<m such that a* G [a , a'] and a < a' 

for all j = 0, . . . ,m, then if a]y^^^° °)jV~^ — > 0, for all {vn)n satisfying Vn-cl]^'^^"' "■*iV~^ — > 0, 

N^oo N^oo 

f(vn\\t* - zWm >v) — ^ for all t] > 0. (5) 

\ / AT— ^oo 

Remark 2 The proof of this result is provided in [6]. Unfortunately, the rate of convergence of ||t* — £||m 
is only = with < a < 1 and not N as, for instance, in [12] and [13]. However the context is not 
the same: in these papers, the contrast is directly computed from N values of {X)i which do not change 
following r. Here, the contrast is computed from only [m + 1)1 values of Sn which change following r. 
The rate of convergence N can not be reached in such a context (simulations show also this property). 
This is certainly a drawback of your method, which hopefully does not change the rate of convergence of 
parameters {aj) and {/3j). 

For J = 0, 1, . . . , m, the log-log regression of (S^^'''^(r,ajv))j^<.<^ onto (riaAr)i<i<^ provides estimators of 

a* and /JJ. However, if tj converges to rj, kj = N-tj does not converge to kj, and therefore P([fcj, fcj+i] C 
[kj, fc^+i]) does not tend to 1. So, define kj and kj such that kj = kj + and kj = fcj+i — From (5) 
with r? = 1/2, P([L,fc']c — > 1. Then, 

J JT^ / JV— >(X) 

Theorem 1.2 LetS* := ) andQj := {LiL-i)~ L^Y- ' = ). Under the same assumptions 

Mog/3*^ Mog/?/ 

as in Theorem 1.1, for j = 0, . . . ,m, with Tj'^^\a*j,ri, ...,re) ■= {l[Li)~^ L[r^^^ {Uj ,ri, . . . ,ri)Lx{L\Li)~'^ , 

\/^^%^(^^-«.-) „-f:,A^(o,s«'(«;,n,....,)) (6) 

A second estimator of G* can be obtained from feasible generalized least squares (FGLS) estimation. 
Indeed, the asymptotic covariance matrix T'^^\a*^ri, . . . ,r() can bo estimated by the matrix V^^^ :— 
r(j)(a,-, ri, ...,re) and f^^) r(-?)(a* n, ...,re). Then, the FGLS estimator 6,- of 6* is defined from 

JV— >(x) ■' 

the minimization among all 6 of the following squared distance, 

II 1^ - . e ||?<,,= (1^ - L„„e)' . (f w)-^ . (1^ - L„„e). 

and therefore define 6^ := (Li(fW)"^Li)"^Li(fW)"V^*'^ 

Proposition 1.1 Under the same assumptions as in Theorem 1.2, for j = 0, . . . ,m 



(e, - e-) A'(o. n ..... r,)) 



(7) 



with M^^\a* ,ri, . . . ,re,) := (ij^ (F*^^^ (a* , ri, . . . , r^)) ^Li) ''' < T,^^\a* ,ri, . . . ,ri) (for the order's rela- 
tion between positive symmetric matrix). 

Therefore cij is more accurate than aj for estimating a* when A'' is large enough. For j = 0, . . . , m, let 
T^^ be the FGLS distance between points ( log(ri un), log (-S-O ) and the FGLS regression line. The 
following theorem describes the asymptotic behavior of a goodness-of-fit test on each segment [kj,k'j[: 



Theorem 1.3 Under the same assumptions as in Theorem 1.1, for j = 0, . . . ,m 

TU) = II Y^^ - L„„e, II?,, x'{i - 2). (8) 

2. Applications 

2.1. Piecewise long memory Gaussian processes 

Assume that the process X = {Xt)f^f^ is a Gaussian piecewise long-range dependent (LRD) process, i.e. 
there exists {D*)o<j<m e (0, 1)^+^ and for all j = 0, . . . , m and € {[Nt*], [Nt*] + 1, . . . , [Nt*^^] - l}, 

Xk = xl^}^^^,-^, where X^^^ = {xl^^)tQK satisfies the following Assumption LRD(£>p. 

Assumption LRD(D): F is a centered stationary Gaussian process with spectral density / such that 
/(A) = \X\-" ■ /*(A) for all A e [-7r,7r] \ {0} with /*(0) > and with C2 > 0, |/*(A) - /*(0)| < 
C2-|A|2 for all AG [-7r,7r]. 

Following [5], if the mother wavelet is supposed to be included in a Sobolev ball, then 

Corollary 1 Let X be a Gaussian piecewise LRD process defined as above and ip : R IR &e [0, 1]- 
supported with i^{0) = '4'{l) = and ip{t) dt = and such that there exists sequence {tpe)eei. satisfying 
V'(A) = E^ez V'^e^'^'^^ e 1L2([0, 1]) and E€ez(l + \^\Y^^^\H < 00. Under Assumption C, for all < k < 
2/15, if UN = iV«+V5 and vn = 7V2/5-3« ^5^^ ^gj^ ^gj j^^i^ 

Thus, the rate of convergence of r to r* (in probability) is N'^/^^^'^ for < k arbitrary small. Estimators 
Dj and Dj converge to the parameters D*^ following a central limit theorem with a rate of convergence 
^2/5-k/2 Jqj. q < ^ Convincing results of simulations can be observed in Table 1 and Figure 1. 



Vmse 



N = 20000 


AT = 5000 


N = 10000 


ri:0.75 Do:0.2 Di:0.8 


ri:0.,3 r2:0.78 i?o:0.6 Hi:0.8 i?2:0.5 


Tl 

0.7540 


Do 
0.1902 


Di 
0.7926 


Tl 

0.3465 


T2 

0.7942 


Hq 
0.5578 


Hi 
0.7272 


H2 
0.4395 


Tl 

0.3086 


T2 

0.7669 


Ho 
0.5597 


Hi 
0.7633 


Hi 
0.4993 


0.0215 


0.0489 


0.0761 


0.1212 


0.1322 


0.0595 


0.0837 


0.0643 


0.0893 


0.0675 


0.0449 


0.0813 


0.0780 


0.0218 


0.0499 


0.0764 


0.1298 


0.1330 


0.0730 


0.1110 


0.0883 


0.0897 


0.0687 


0.0604 


0.0892 


0.0780 



Table 1 

Left: Estimation of n, Do and Di in the case of piecewise FARIMA(0,(ij,0) (di = 0.1 and 6,2 = 0.4) with one change point 
when N = 20000 (50 realizations). Right: Estimation of n, n, Hq, Hi and H2 in the case of piecewise FBM with two 
change points when N = 5000 and N = 10000 (50 realizations) 

2.2. Piecewise fractional Brownian motions 

Now, X will be called a piecewise fractional Brownian motion if there exist two famihes of param- 
eters {H*)o<j<rn e (0,1)™+^ and {<7*^)o<j<m e (0,00)"'+^ such that for all j = 0,...,m and t e 
[[Nt*], [Nt*] 1, . . . , [Nt*^-^] -l], Xt= X^^J^j^^,-^, where X^^') = {X^^'>)ten is a FBM with parameters 
H* and a*^. Following the results of [3], one obtains. 

Corollary 2 Let X be a piecewise FBM and ^ : M, ^ M, be a piecewise continuous and left (or right)- 
differentiable, such that is Riemann integrable with il)'{t~) the left-derivative of V' in t, with 

support included in [0, 1] and tP^{t) dt = tPiJj{t) d,t = for p = 0,1.. Let A := | sup^ H* — infj H* | . 
If A < 1/2, under Assumption C, for allO<K< j^-^, if an = N^'^+'^ andvN = N'^/HT--2A)-k(2+aa) 
then (5), (6), (7) and (8) hold. 
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Remark 3 The dependence of this result on A can be explained by the fact that 2(supj a* — infj a*) + 
1, with a* — 2H* + 1, has to be smaller than 3 since ■ N^^/^ — > oo. However, Corollary 2 is 

quite surprising: the smaller A, i.e. the smaller the differences between the parameters Hj , the faster the 
convergence rates of estimators tj to t* . If the difference between two successive parameters Hj is too 
large, the estimators Tj do not seem to converge. This is attributable to the influence of the other segments 
that is even deeper than the involved exponents are different (simulations exhibit this paroxysm in [6]). 

Thus, the rate of convergence of t to r* (in probabiHty) can be N^/H'^-^^)-'^ for < k' as small as one 
wants when oat — Afi/3+« /(2+4A)_ j^gsults of simulations can be observed in Table 1 and Figure 1 in a 
case where A — 0.3 < 1/2. 




Figure 1. Left: Piecewise FARIMA(0,(ij,0) (with do : 0.1 (Do : 0.2), di : 0.4 {Di : 0.8) and n : 0.75). Right: Piecewice 
FBM(J^j ) (n : 0.3, T2 : 0.78, Ho : 0.6, Hi : 0.8 and H2 : 0.5), (n : 0.32, ^2 : 0.77, Ho ■ 0.5608, Hi : 0.7814 and H2 ■ 0.4751). 
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